Gradients of Counterfactuals

نویسندگان

  • Mukund Sundararajan
  • Ankur Taly
  • Qiqi Yan
چکیده

Gradients have been used to quantify feature importance in machine learning models. Unfortunately, in nonlinear deep networks, not only individual neurons but also the whole network can saturate, and as a result an important input feature can have a tiny gradient. We study various networks, and observe that this phenomena is indeed widespread, across many inputs. We propose to examine interior gradients, which are gradients of counterfactual inputs constructed by scaling down the original input. We apply our method to the GoogleNet architecture for object recognition in images, as well as a ligand-based virtual screening network with categorical features and an LSTM based language model for the Penn Treebank dataset. We visualize how interior gradients better capture feature importance. Furthermore, interior gradients are applicable to a wide variety of deep networks, and have the attribution property that the feature importance scores sum to the the prediction score. Best of all, interior gradients can be computed just as easily as gradients. In contrast, previous methods are complex to implement, which hinders practical adoption.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Which Counterfactuals Matter? A Response to Beck

In our article (Weisberg & Gopnik, 2013), we construct a unifying theory of imaginative processes, in which counterfactuals and pretend games share an underlying cognitive capacity. This theory allows us to explain why children pretend, why they create fantastical pretend worlds, and how pretend play and counterfactual reasoning can help children to learn about the causal structure of reality. ...

متن کامل

Time-Symmetrized Counterfactuals in Quantum Theory

Counterfactuals in quantum theory are briefly reviewed and it is argued that they are very different from counterfactuals considered in the general philosophical literature. The issue of time symmetry of quantum counterfactuals is considered and a novel time-symmetric definition of quantum counterfactuals is proposed. This definition is applied for analyzing several controversies related to qua...

متن کامل

What-If at Waterloo. Carl von Clausewitz’s use of historical counterfactuals in his history of the Campaign of 1815

In this article, I analyze the use of historical counterfactuals in the Campaign of 1815 by Carl von Clausewitz (1780–1831). Such is the importance of counterfactuals in this work that its gist can be given in a series of 25 counterfactuals. I claim that a central role is played by evaluative counterfactuals. This specific form of counterfactuals is part of a didactic method that allows Clausew...

متن کامل

The day after an electoral defeat: counterfactuals and collective action.

An intriguing question for scholars of collective action is how participants of unsuccessful actions become re-engaged in future collective activities. At an individual level, previous research has shown that after negative outcomes counterfactual thoughts ('if only … ') may serve to prepare for future action. In the current research, we investigated whether counterfactuals may also prepare for...

متن کامل

The logic of counterfactual analysis in case-study explanation.

In this paper, we develop a set-theoretic and possible worlds approach to counterfactual analysis in case-study explanation. Using this approach, we first consider four kinds of counterfactuals: necessary condition counterfactuals, SUIN condition counterfactuals, sufficient condition counterfactuals, and INUS condition counterfactuals. We explore the distinctive causal claims entailed in each, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1611.02639  شماره 

صفحات  -

تاریخ انتشار 2016